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The present invention relates to a novel protein, a gene encoding the protein, a transformant comprising 
the gene, and a method for producing the protein with the use of the transformant. More specifically, the 
present invention relates to a novel protein having a protease activity to convert inactive single-chain 
hepatocyte growth factor (HGF) into active two-chain HGF by cleaving the inactive HGF at a specific site, a 
5 gene encoding the protein, a transformant comprising the gene, and a method for producing the protein 
with the use of the transformant. The present invention also relates to a precursor of the protein, a gene 
encoding the precursor, a transformant comprising the gene, and a method for producing the precursor with 
the use of the transformant. 

It is known through in vitro experiments that a single-chain human HGF does not have physiological 
70 activities but two-chain human HGF has the activities. It is also known that most of the human HGFs 
produced by a gene recombinant technique with the use of a serum-free culture are single-chain form. A 
method for producing HGF with the use of a serum-free culture was preferable because the addition of 
serum to culture is expensive. 

Part of the inventors of the present invention previously discovered a protein in mammalian serum that 
75 converts such single-chain HGF into two-chain HGF. They also found that the protein had a molecular 
weight of about 34,000 dalton determined by SDS polyacrylamide gel electrophoresis (Japanese Patent 
Publication (Kokal) No. 5-103670) However, it requires considerable labor to obtain a purified product of the 
protein, because the protein exists only a very small amount in serum. Further, the protein in an active form 
is unstable in serum. 

20 As the result of our research to overcome the above disadvantages and to readily obtain a protein 

having an equivalent activity to the above-mentioned protease activity, the inventors of the present invention 
eventually obtained a gene encoding the protein, and found that the protein can be produced on a large 
scale by the use of a gene engineering technique. 

Further, the inventors attempted to purify, from human plasma, a precursor of the protein having 

25 protease activity, which converts single-chain HGF into active two-chain HGF. 

As the result, the inventors have discovered two novel precursor proteins having molecular weights of 
about 34,000 dalton and about 96,000 dalton determined by SDS polyacrylamide gel electrophoresis, which 
react with a mouse antibody against an active protease existing in human serum, and that by the treatment 
with a serine protease, the two precursor proteins are processed to have the above-mentioned protease 

30 activity. 

Further, as the result of the research for the purpose of readily obtaining proteins having equivalent 
properties as the above-mentioned precursor proteins, the inventors eventually obtained a gene encoding 
such precursor protein, and found that the proteins can be produced on a large scale by the use of a gene 
engineering technique. 

35 Thus, the present invention provides a protein characterized by the amino acid sequence of SEQ ID 

NO: 1, a gene encoding the protein, a vector that can be used to express a polypeptide encoded by the 
gene, a transformant including the vector, and a method for producing the protein with the use of the 
transformant. 

Further, the present invention provides novel protein characterized by the following physical and 
40 chemical properties: 

(i) its molecular weight is about 96,000 daiton and 34,000 daiton determined by SDS polyacrylamide gel 
electrophoresis; and 

(ii) by the treatment with a serine protease, it is processed to have a protease activity which specifically 
cleaves the human hepatocyte growth factor consisting of 728 amino acid residues at a site between 

45 arginine at the 494th position and valine at the 495th position counting from the amino terminal. 

The present invention also provides a protein characterized by the amino acid sequence of SEQ ID NO: 
12, a gene encoding the protein, a vector that can be used to express a polypeptide encoded by the gene, 
a transformant including the vector, and a method for producing the protein with the use of the transformant. 
The present invention is further illustrated below 

: i. r r - fv>w v Tht : •• Tv' tri j-< Tu " • • • • ; .r • -s.- m . • . ' • f • ::. r .••!• •(• •• • , 

.•rutease activity or me present nwerui'.>u. written i r,. u u.'pubunib n ; >. n i<n. .muni u Vv . osui c 
a cytomegalovirus (CMV) promoter. Ap' represents an ampicilhn resistance gene, i^m represent u 
neomycin resistance gene. 

bb Figure 2 presents a bar graph showing the rate of the production of two-chain HGF and single-chain HGF 
if) the presence or absence of tire precursor proteins activated by thrombin of the present invention, 

wherein 1 represents the oxpon-nontal rese t in the presence of about 100 rg ml of tne precursor protein 
uavmg a molecular weight o 4 about 90,000 dalton. 2 represents th- iesult m tnu presence <>' about 100 
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ng/ml of the precursor protein having a molecular wight of about 34,000 dalton, and 3 represents the 
result in the absence of the precursor proteins. 
A novel protein having a protease activity of the present invention (hereinafter, sometimes referred to as 
"a protein having a protease activity") includes a protein having the amino acid sequence of SEQ ID NO: 1 
5 and variants thereof having the same protease activity obtained by a deletion, replacement, modification, or 
addition of a part of amino acids of the protein having the sequence of SEQ ID NO: 1. 

Examples of a gene encoding the above-mentioned protein having a protease activity include a gene 
comprising the base sequence of SEQ ID NO: 2 as a part of the gene and a gene represented by the base 
sequence of SEQ ID NO: 3. 

io A DNA fragment including such gene can be obtained as described below. As for a cDNA library 

including DNA encoding the protein having a protease activity of the present invention, a commercially 
available cDNA library prepared from human liver can be used. Phagemids from the library are used to 
infect cells as described by Saito et al., (Proc. Natl. Acad. Sci. USA, 83, 8664-8668 (1986)) and the infected 
cells are cultured. Colonies obtained after culturing are screened by the colony hybridization method 

75 ("Molecular Cloning", Cold Spring Harbor Laboratory, 320-328 (1982)) with the use of partial DNA fragments 
or DNA fragments having base sequences that correspond to partial amino acid sequences of the protein of 
the present invention as probes and desired DNA fragments can be obtained. 

As for probes used in the colony hybridization method, DNA fragments which include parts of the gene 
encoding the protein having the protease activity of the present invention and which are prepared by the 

20 polymerase chain reaction (PCR) method (Science, 239, 487-491 (1988)) can be used. In more detail, PCR 
is carried out with the use of the DNA fragment of SEQ ID NO: 4 (corresponding to a part of the amino acid 
sequence of SEQ ID NO: 6) as a plus ( + ) strand DNA primer and the DNA fragment of SEQ ID NO: 5 
(corresponding to a part of the amino acid sequence of SEQ ID NO: 7) as a minus (-) strand DNA primer 
and the resulting DNA fragment of SEQ ID NO: 2 is used as a probe. Synthetic oligonucleotides based on 

25 DNA sequences deduced from the amino acid sequence of the protein of the present invention can also be 
used as probes. 

DNAs are then prepared from positive colonies by the method of T. Maniatis et al. ("Molecular 
Cloning", Cold Spring Harbor Laboratory, 85 (1982)). The resulting DNAs are digested with an appropriate 
restriction enzyme, e.g., BamHI, cloned in a plasmid, e.g., pUCl8 or the like, and sequenced by the 

30 dideoxy method of Sanger et al. (Proc. Natl. Acad. Sci. USA, 74, 5463 (1977)), thereby the base sequences 
of the desired DNA fragments can be determined. 

The base sequences of the DNA fragments thus determined (for example, the sequences comprising 
the base sequence of SEQ ID NO: 2 as a part and the base sequence of SEQ ID NO: 3) encode the protein 
of the present invention of SEQ ID NO: 1 . The DNA fragments of the present invention also include variants 

35 of the DNA fragments encoding the protein as long as variant-encoded polypeptides have a protease 
activity to convert single-chain HGF into two-chain HGF. Such variants are prepared by a deletion, 
replacement, modification, and addition of a part of the bases. 

Each of the DNA fragments so obtained is modified at their 5* terminal, inserted into downstream of a 
promoter in an expression vector, and introduced into a host cell, for example, E. colli, Bacillus subtilis , 

40 yeast, or an animal cell. 

A method for producing the protein of the present invention is illustrated in detail below. Preferably, an 
expression vector including a promoter at an appropriate site to transcript the DNA fragment encoding the 
protein of the present invention is used. For example, when a host cell is derived from microorganisms, 
such as E. coli, Bacillus subtilis , an expression vector comprising a promoter, ribosome binding (SD) 

45 sequence, the gene encoding the protein, transcription termination sequence, and a gene regulating the 
promoter is preferable. 

Examples of promoters used in the present invention include promoters derived from E. coli, phages or 
the like, such as promoter of tryptophan synthase (trp); rambda phage P L and P R ; and P25 and P2& 
promoters, which are promoters of early gene of T5 phage. Promoters which are modified or designed 
50 artificially, such as pac promoter (Agric. Biol. Chem., 52, 983-988 (1988)) are also useful. 

As for ribosome binding sequences, sequences derived from E. coli , phage or the like as well as 
synthetic sequences which include a consensus sequence having contiguous four or more bases of a 
sequence complimentary to a sequence at a 3' terminal region of 16S ribosome RNA can be used. 
Transcription termination sequences are not critical, but preferably, an expression vector includes 
55 lipoprotein terminator, trp operon terminator, or the like. 

Further, it is preferable that an expression vector includes the above-mentioned sequences, a promoter, 
a SD sequence, the gene encoding the protein, and a transcription termination sequence in this order from 
5' upstream. An expression vector including multiple units of SD sequence and the gene in the same 

4 



DOCID <EP_ _0596524A2..I. > 



BP 0 59G 524 A2 



orientation can also be used for increasing the number of the copies of transcriptional units in a vector 
(Japanese Patent Publication (Kokai) No. 1-95798). The method comprises a step of inserting into an 
expression vector. 

Examples of expression vectors that can be used include pUA12 (Japanese Patent Publication (Kokai) 
5 No. 1-95798), commercially available pKK233-2 (Pharmacia), and the like. Similarly, the expression vectors 
pGEX series (Pharmacia), which can express a desired protein as a fusion protein, are also useful. Standard 
methods for transforming host cells can be employed. 

Transformants can be cultured according to the method described in "Molecular Cloning" (Cold Spring 
Harbor Laboratory, 1982). 

70 As described above, host cells derived from microorganisms such as E. coli, Bacillus subtilis , yeast, 

and the like can be used, but animal cells, such as CHO cells, COS cells, mouse L cells, mouse C127 cells, 
mouse FM3A cells and the like are preferably used to express the gene encoding the protein, taking it into 
consideration that the activity of the protein of the present invention may be affected by the protein 
conformation and the sites of thiol bindings between many cysteine residues included in the protein. 

75 Among known various promoters used for animal cells, SV40 promoter, promoter of the metallothionein 

gene and the like can be used. Under such promoter, a secretion signal and the gene encoding the protein 
are inserted according to the* transcriptional orientation. A DNA fragment including two or three genes can 
also be inserted instead of a single gene. Two or three units of the gene and a promoter at its 5* side can 
be combined together and inserted into an expression vector according to the transcriptional orientation. 

20 It is preferable that the gene encoding the protein is followed by a polyadenylation site downstream of 

the gene. For example, one of the polyadenylation sites derived from SV40 DNA, £-globin gene, 
metallothionein gene, and the like is inserted downstream of the gene encoding the protein. When two to 
three units of the gene and a promoter at its 5' side are combined together, each of the gene in a unit can 
be followed by a polyadenylation site at its 3' side. Further, to attain high level of expression, it is effective 

25 to insert genes derived from animal, such as SV40 gene, rabbit 0-globin gene or the like; or splicing signal 
sequences, of intron or exon, synthesized chemically, into upstream or downstream of the gene encoding 
the protein. 

It is preferable to use selection markers when animal cells, e.g., CHO cells, are transformed with the 
above mentioned expression vectors. With the use of an expression vector including selection marker gene 

30 downstream of a polyadenylation site in the same or reverse orientation, it is not necessary to cotransform a 
cell with additional plasmid including a selection marker gene. Examples of such selection marker genes 
include a DHFR gene which confers a cell a methotrexate resistance (J. Mol. Biol., 159, 601 (1982)); a Neo 
gene which confers a cell an antibiotic G-418 resistance (J. Mol. Appl. Gene., 1, 327 (1982)); the Ecogpt 
gene derived from E. coli which confers a cell a mycophenolic acid resistance (Proc. Natl. Acad. Sci. USA, 

35 78, 2072 (1981)); a hph < gene which confers a cell an antibiotic hygromycin resistance (Mol. Cell. Biol., 5, 
410 (1985)); and the like. 

Each of these resistance genes is linked to a promoter, e.g., a promoter derived from SV40, at its 5' 
upstream and to a polyadenylation site at its 3* downstream. Commercially available expression vectors 
comprising a selection marker can be used. For example, pcDNA/Neo that comprises a neomycin 
40 resistance gene, can be used. 

When an expression vector comprising the gene encoding the protein of the present invention does not 
have any selection markers, an additional vector comprising a selection marker, e.g., pSV2Neo (J. Mol. 
Appl. Genet., 1, 327 (1982)), pMBG (Nature, 294, 228 (1981)), pSV2gpt (Proc. Natl. Acad. Sci. USA, 78, 
2072 (1981)), pAd-D26-1 (J. Mol. Biol., 159, 601 (1982)) can be used with the expression vector to 
45 cotransform a cell, thereby the resulting transformant can be selected readily by a phenotype. 

It is preferable to repeat cotransformation of the cell including the gene encoding the protein with the 
use of various selection markers, because the expression level of the protein can be increased by a 
cotransformation. 

An expression vector is introduced into an animal cell by the calcium phosphate precipitation method 

, s rsK''i miif 1 f in ma i • h ,a n. ■ uiuhou o* m« motrooc Knuwn .rtiinanv > ki ice c ?f • am. ■ \>. mam 
■ □sponsion cuKuie ana an adhesion culture. MtM medium. eiPMi Ib4u medium, una trie uku can ue useu 
and the ceils are cultured in the presence of 5 to 10 % serum or an appropriate growth factor, or in the 
absence of serum. The transformed animal cells producing the protein secrete the protein into a medium. 
bb Therefore, the protein can be separated and purified from the supernatant of the culture. In more detail, the 
supernatant of the culture including produced protein can be purified by various chromatographies, such as 
a chromatography including combination of two or more resins selected ''torn anion exchange resin, heparin 
immobilized resin, hydrophobic chromatography resin, affinity chromatography resin, and the like to obtain 
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isolated and purified protein. 

Precursor proteins of the present invention are described below. The precursor proteins of the present 
invention can be obtained by the following purification steps. The precursor protein with the molecular 
weight of about 96,000 dalton can be derived from the plasma prepared by the steps of drawing blood from 

5 a normal individual (human), adding ethylenediaminetetraacetic acid (EDTA), benzamidine, 6-amino-n- 
caproic acid n-hexyl ester p-toluensuffonate, soybean trypsin inhibitor, 6-amidino-2-naphthyl p- 
guanidinobenzoate dimethanesulfonate (naphamostat mesilate), aprotinin or the like to inhibit actions of 
coagulation factors in the blood, and removing cells by centrifugation. After the plasma is diluted with water 
to two to five volumes, the diluted plasma is chromatographed by a Heparin-sepharose column (Pharmacia) 

w or the like. The resulting fractions including the precursor protein are further chromatographed by a 
hydrophobic chromatography column (for example, with the use of Phenyl-sepharose column manufactured 
by Pharmacia). The fractions including the precursor protein are chromatographed by an antibody affinity 
chromatography column, thereby the precursor protein of the present invention can be obtained. This 
antibody affinity chromatography column is prepared by the steps of immunizing a mouse with the novel 

75 human protein described in Japanese Patent Publication (Kokai) No. 5-103670, fusing a single cell from the 
mouse spleen and a myeloma cell to obtain a hybridoma producing an antibody reactive with the 
immunized protein, and immobilizing the antibody onto appropriate resins according to a standard method. 

The precursor protein with the molecular weight of about 34,000 dalton can be derived from the plasma 
prepared by the steps of drawing blood from a normal individual (human), and adding EDTA, 6-amino-n- 

20 caproic acid n-hexyl ester p-toluensulfonate, or the like. The plasma is chromatographed by a Heparin- 
sepharose column or the like in the same manner as described above. The resulting fractions including the 
precursor protein are chromatographed by a hydrophobic chromatography column or the like in the same 
manner as described above. The resulting fractions including the precursor protein are applied onto 
aprotinin immobilized affinity column (Pentalirm) and unbounded fractions are chromatographed by the 

25 above-mentioned antibody affinity column, thereby the precursor protein can be obtained. Optionally, a step 
of purifying the proteins of the present invention includes an ion exchange column chromatography, a gel 
filtration chromatography, a hydroxyapatite column chromatography, or the like. 

The molecular weights of the purified precursor proteins of the present invention are about 96,000 
dalton and about 34,000 dalton determined by SDS polyacrylamide gel electrophoresis. They are processed 

30 to have the similar activity as that of the human protein described in Japanese Patent Publication (Kokai) 
No. 5-1 03670 by the treatment with a serine protease such as thrombin. One of the precursor proteins of the 
present invention includes the amino acid sequence of SEQ ID NO: 1 1 as a part of the protein. One of the 
precursor proteins of the present invention include the amino acid sequence of SEQ ID NO: 12 as a part of 
the protein. Also included in the scope of the present invention are variants of the precursor proteins 

35 prepared by a deletion, replacement, modification, or addition of a part of the amino acid sequences of the 
precursor proteins. 

Examples of the genes encoding the precursor proteins include a gene comprising the base sequence 
of SEQ ID NO: 13 as a part of the gene, a gene represented by the base sequence of SEQ ID NO: 14, and 
the like. 

40 A DNA fragment including such gene can be obtained by the same method for obtaining the protein 

having a protease activity as described above. 

As for probes used in the colony hybridization method, DNA fragments that include parts of the gene 
encoding the precursor proteins and that prepared by the PCR method can be used. For example, PCR is 
carried out with the use of the DNA fragment of SEQ ID NO: 4 as a plus ( + ) strand DNA primer and the 

45 DNA fragment of SEQ ID NO: 5 as a minus (-) strand DNA primer and the resulting DNA fragment of SEQ 
ID NO: 13 is used as a probe. Synthetic oligonucleotides based on DNA sequences deduced from the 
amino acid sequence of the precursor proteins of the present invention can also be used as probes. 

Further, DNAs are prepared from positive colonies and the base sequences of the desired DNA 
fragments can be determined by the method described above. 

50 The base sequences of the DNA fragments thus determined (for example, the sequences comprising 

the base sequence of SEQ ID NO: 13 as a part and the base sequence of SEQ ID NO: 14) encode the 
precursor proteins of the present invention. The DNA fragments of the present invention also include 
variants of the DNA fragments encoding the precursor proteins as long as variant-encoded polypeptides 
acquire a protease activity to convert single-chain HGF into two-chain HGF, after the polypeptides are 

55 processed. Such variants are prepared by a deletion, replacement, modification, and addition of a part of 
the bases. 

Each of the DNA fragments so obtained is modified at their 5* terminal, inserted downstream of a 
promoter in an expression vector, and introduced into a host cell, for example, E. coli , Bacillus subtilis , 
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yeast, or an animal cell by a standard method. 

Production of the precursor proteins of the present invention may be conducted in a similar manner to 
the method for producing the protein having a protease activity described above. 

The present invention is further illustrated by the following examples. However, this invention is not 
5 limited to the following examples. 

Example 1 Purification of the protein having a protease activity and determination of partial amino acid 
sequence of the same 

10 A protein having molecular weight of about 34,000 determined by SDS polyacrylamide gel elec- 

trophoresis was purified from human serum by the method described in Example 2 in Japanese Patent 
Publication (Kokai) No. 5-103670, said protein having a protease activity to convert single-chain HGF into 
two-chain HGF. The purified protein was allowed to react in Buffer A (6M guanidine hydrochloride, 0.002M 
EDTA, and 1M Tris-HCI buffer, pH 8.5) with 2-mercaptoethanol at 4 • C for 2 hours to reduce the protein. 

75 Monoiodoacetic acid at the same concentration as that of the protein was added to the resulting reaction 
solution and the mixture was allowed to react in the presence of nitrogen gas at room temperature for one 
hour in the darkness for carboxymethylation of the protein. The resulting reaction mixture was applied to the 
YMC pack C4 column (YMC). The column was then eluted with a linear gradient of 10% to 70% 
acetonitrile/isopropyl alcohol (3/7) for 30 minutes, and the fractions including the main peak were collected. 

20 The pooled fractions were mixed with 0.1% ammonium bicarbonate solution including 2M urea, and 

allowed to react with TPCK-trypsin (MILES Laboratory) or TLCK-chimotrypsin (MILES Laboratory) in a ratio 
of enzyme:substrate = 1 :50 at 37 'C for 16 hours. The resulting reaction mixture was applied to high 
pressure liquid chromatography (HPLC). The column was then eluted with a linear gradient of 0% to 80% in 
acetonitrile/isopropyl alcohol (3/7) for 1 hour, and multiple peptide fragments were obtained. 

25 The peptide fragments were dried under reduced pressure, dissolved in 60 ul of 50% trifluoroacetic 

acid, added to a glass filter treated with polybrene, and the amino acid sequence thereof was determined 
by $470A Sequencer (Applied Biosystems) employing Edman degradation. The identification of each 
phenylthiohydantoin (PTH) amino acid was performed with the use of MCI gel ODS IHU (0.46 x 15 cm) 
column (Mitsubishi Kasei Corp.). The column was eluted with acetate buffer (10mM acetate buffer, pH4.7, 

30 0.01% SDS, and 38% acetonitrile) at a flow rate of 1.2ml/min, at 43 *C, and a PTH-amino acid was detected 
by 269 nm absorbance. Among these peptides, two amino acid sequences are shown in SEG ID NO: 6 and 
7. 

Example 2 Preparation of DNA fragments including parts of the gene encoding the protein having a 
35 protease activity by the PCR method 

Commertially available human liver cDNA bank (Quickclone™ human liver cDNA, Clonetech) was used 
as a template DNA. PCR was performed by Perkin Elmer Cetus DNA Thermal Cycler with the use of Gene 
Amp DNA Amplification Reagent Kit (Takara Shuzo Co. Ltd.). First, 100 u! of reaction mixture was prepared 

40 including a template DNA (1ng), 10 ul of x10 reaction buffer (500 mM KCI, 100 mM Tris-HCI buffer, pH8.3, 
15 mM MgCb, and 0.1% (w/v) gelatin), 2 ui each of 10 mM dGTP, dATP, dCTP, and dTTP, 0.1 ul of ( + ) 
strand DNA primer of SEQ ID NO: 4 as primer #1, 0.1 ul of (-) strand DNA primer of SEQ ID NO: 5 as 
primer #2 to make final concentration of 0.1 uM, 0.5 ul of Taq DNA polymerase, and sterilized deionized 
water. Template DNA was then amplified by 30 cycles of the PCR including pretreatment at 94° C for 10 

45 minutes, denaturation at 94 °C for 1 minute, annealing at 37 'C for 2 minutes, and extension at 72 *C for 3 
minutes. The reaction was stopped by incubation at 72 • C for 7 minutes. 

The resulting reaction mixture was extracted with phenolxhroloform = 1:1, and ethanol precipitated. The 
precipitate was dissolved in 21.5 ul of sterilized deionized water. Then, 2.5 ul of x10 restriction enzyme 
buffer (50 mM Tris-HCI buffer. pH7 5 10 mM MgCb. 100 mM KCI. and 1 mM DTT) and 1 ul of restriction 

•Ml!; ■•)»■ ' i.i.: clinil ,ri KUJft ! war *■»■?•: tmi.hori >S*-" ' ' . •• ■iV'tii.'.r v laiTIK h ■ ■< - h h V". OHi if i :Sl : . " >■ >c 

. l. uarid was extracted from tnu qui uy .i sianaaiu k;(_nniqut e «.;■.... uve* a uNA naqmuM. afui rn<. waufm^i. 
was ethanol precipitated. 

The DNA fragment was inserted into pUCl9 vector at BamHI site and the base sequence thereof was 
bb identified according to a standard method. The base sequence of the identified DNA fragment prepared by 
PCR is shown in SEQ ID NO 2 
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Example 3 Screening lor a clone including a complete gene encoding the protein having a protease activity 

The fragment with 323 bp prepared as described in Example 2 was labeled with 32 P according to the 
method described in "Molecular Cloning" (Cold Spring Harbor Laboratory, 1982) and used as a probe for 

5 screening. The library to be screened was Premade Rambda Phage Library (Stratagene), which was a 
human liver cDNA library derived from 49 years old male. E.coli XL1-Blue (Stratagene) was infected with 
the phage to make about 500 milion plaques. The infected cells were cultured on NZY medium overnight 
and then transfered on Gene Screen Plus membranes (Dupont). The membranes were allowed to stand on 
paper filters soaked with 0.1 M NaOH - 0.5 M NaCI for 2 minutes, and then on paper filters soaked with 1.5 

w M NaCI-0.5 M Tris-HCI buffer (pH7.5) for 5 minutes. The memranes were further treated two more times, 
washed with 2 x SSC, and dried on a dried paper filter in the air. DNAs on the membranes were fixed with 
UV light at 120 mJ/cm 2 . 

Five membranes thus treated were soaked in 50 ml of solution including 50 mM Tris-HCI buffer, pH 7.5, 
1 M NaCI, and 1 % SDS and allowed to stand at 65 • C for 2 hours. The membranes were hybridized in 40 
75 ml of solution including 5 ng/ml probe labeled with P, 100 ug/ml salmon sperm DNA, 50 mM Tris-HCI 
buffer, pk 7.5, 1 M NaCI, and 1% SDS at 65 ° C for 16 hours. The membranes were then washed in 2 x SSC 
at room temperature for 5 minutes, 0.1 x SSC at room temperature for 30 minutes for two times, and 
autoradiographed by a standard method, thereby 40 positive clones were obtained. 

20 Examples 4 Subcioning of DNA fragments and identification of the base sequences thereof 

Plasmids were prepared directly from positive phage clones obtained in Example 3 by the excision 
method. Phage was extracted from single plaque by the treatment with 500 ul of SM buffer (50 mM Tris- 
HCI buffer, pH 7.5, 100 mM NaCI, 10 mM MgS0 4 . and 0.01 % gelatin) and 20 ul of chroloform. The mixture 

25 including 200 ul of the above-mentioned phage extracted solution, 200 ul of XL1-Blue, and 1 ul of R408 
helper phage was allowed to stand 37 W C for 15 minutes. Then, 5 ml of 2 x YT medium was added to the 
mixture and the resulting solution was shake cultured at 37 • C for 3 hours. The culture was then heated to 
70 °C for 20 minutes followed by centrifugation with 4000g for 5 minutes to obtain a supernatant. The 
supernatant was diluted to make 100-fold diluted solution. Then 20 ul of the diluted solution was mixed with 

30 200 ul of XLT-Blue and the mixture was allowed to react 37 # C for 15 minutes. Then 2 ul of the resulting 
mixture was plated onto a LB agarose medium including 40 ul/ml ampicillin. Twenty-four plasmids from the 
colonies appeared on the plate were obtained, and analyzed. The clone having the longest insert 
(pBHGFAP) was analyzed and the base sequence of the insert was identified. The base sequence thus 
determined is shown in SEQ ID NO: 3. Based on this base sequence, the amino acid sequence of the 

35 protein having a protease activity of the present invention was deduced (SEQ ID NO: 1 ). 

Example 5 Construction of an expression vector 

Figure 1 shows a construction of an expression vector including the gene encoding the protein having a 

40 protease activity of the present invention. 

The plasmid pBHGFAP prepared in Example 3 was used as a template DNA and PCR was performed 
by Perkin Elmer Cetus DNA Thermal Cycler with the use of the Gene Amp DNA Amplification Reagent Kit 
(Takara Shuzo Co., Ltd.). First, 100 ul of reaction mixture was prepared including a template DNA (0.5 ug), 
10 ul of x10 reaction buffer (500 mM KCI, 100 mM Tris-HCI buffer, pH8.3, 15 mM MgCI 2 , and 0.1% (w/v) 

45 gelatin), 16 ul each of 1.25 mM dGTP, dATP, dCTP, and dTTP, 5 ul of ( + ) strand DNA primer of SEQ ID 
NO: 8 as primer #3, 5 ul of (-) strand DNA primer of SEQ ID NO: 9 as primer #4, 0.5 ul of Taq DNA 
polymerase, and sterilized deionized water. Template DNA was then amplified by 22 cycles of the PCR 
including pretreatment at 94 "C for 10 minutes, denaturation at 94 *C for 1 minute, annealing at 52 'C for 
1 .5 minutes, and extension at 72 • C for 2 minutes. The reaction was stopped by incubation at 72 • C for 7 

50 minutes. The resulting reaction mixture was extracted with phenol: chroloform = 1 :1 , and the amplified DNA 
was ethanol precipitated. The precipitate was dissolved in 16 ul of sterilized deionized water, and 
electrophoresed using 5 % polyacrylamide gel. The 862 bp band was extracted from the gel by a standard 
technique to recover a DNA fragment and the DNA fragment was ethanol precipitated. 

The DNA fragment thus obtained was blunt-ended by a standard method with the use of T4 DNA 

55 polymerase and then cleaved with Xba l. The cleaved DNA fragments were recovered again, and 15 ng of 
the DNA fragments was mixed with 5 ng of synthetic DNA of SEQ ID NO: 10 and 20 ng of a plasmid vector 
pUC18 cleaved with Eco RI and Xba l, and the mixture was allowed to ligate by the use of Ligation Kit 
(Takara Shuzo Co. Ltd.). The resulting plasmid was transformed into competent cells, E.coli JM105 
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(Competent High, Toyobo) according to the manufacturers instruction. The transformants thus obtained 
were screened by a standard method and the plasmid pSHGFAP including the desired 926 bp insert was 
obtained. The plasmid pSHGFAP (8 ug) was cleaved with restriction enzymes Bam HI and Xhll, treated with 
phenol/chroloform, and ethanol precipitated. The precipitate was dissolved in sterilized deionized water and 

5 electrophoresed using 5 % polyacrylamide gel. The 920 bp band was extracted from the gel by a standard 
technique to recover a DNA fragment, which was then ethanol precipitated. Ten ng of the DNA fragment 
thus obtained and 5 ng of pcDNAI/Neo plasmid that was digested with BamHI and Xhll and treated with 
alkali phosphatase were ligated. The resulting plasmid was transformed into competent cells, E. coli DH5 
(Competent High, Toyobo)according to the manufacturers instruction. The transformants thus obtained were 

io screened by a standard method and the plasmid pNSHGFAP including the desired insert was obtained. 

Example 6 Expression of the protein having a protease activity in E. coli 

E. coli comprising the plasmid pSHGFAP prepared in Example 5 was inoculated in 10 ml of LB medium 
75 containg 50 ug/ml ampicillin and cultured at 37'C overnight (12 to 16 hours). Aliguot (0.1ml) of the culture 
was added to 10 ml of LB medium including 50 ug/ml ampicillin and the medium was cultured at 37 *C for 
2 hours. Isopropyl 8-D thio galactosidfi (IPTG). which is a transcriptional inducer nf lac promoter in the 
vector, was added to the culture at a final concentration of 1 mM and the culture was incubated at 37 *C for 
6 hours. Cells were collected from 1 ml of the culture by centrifugation. To confirm the expression of the 
20 protein of the present invention, a western blotting was performed. As a result, the protein expressed in the 
cells was detected. In contrast, when IPTG was not added to a culture or when the cells not including the 
expression vector pSHGFAP was cultured, the protein was not expressed. 

Example 7 Establishment of an animal cell strain expressing the protein having a protease activity of the 
25 present invention 

Plasmid pNHGFAP, which was prepared in Example 5 and includes the cDNA encoding the protein of 
the present invention at a restriction enzyme cleavage site of the expression vector pcDNAI/Neo, was 
recovered from E.coli transformants and purified according to the method of Maniatis et al. ("Molecular 

30 Cloning", Cold Spring Harbor Laboratory, 86-96 (1982)) to obtain an expression vector for the protein of the 
present invention in a large amount. 

CHO cells were transformed with the expression vector so obtained according to the method of Ausubel 
et al. (Current Protocols in Molecular Biology, Green Publishing Associates and Wiley-Inter Science, 9.1.1- 
9.1.4 (1987)). In more detail, first, CHO cells were cultured in tissue culture dish with a diameter of 9 cm 

35 containing ERDF medium (Kyokuto pharmaceiutical) supplemented by 10% fetal bovine serum (FBS) until 
cells were grown to semiconfluence. The medium was then removed from the dish and a DNA solution, 
which was prepared as described below, was dropwisely added. The DNA solution, for a single dish with a 
diameter of 9 cm, was prepared by the steps of making 570 ul mixture in an eppendorf tube including 300 
ul of 2 x HEBS solution (1.6 % NaCI, 0.074 % KCI, 0.05 % disodium hydrogenphosphate 12H 2 0, 0.2 % 

40 dextrose, and 1 % HEPES, pH7.05), 10 ug plasmid DNA, and sterilized water; dropwisely adding 30 ul of 
2 5 M CaCb solution to the mixture while the mixture was vigorously voltexed for a few second; and 
allowing the mixture to stand for 30 minutes at room temperature. 

The DNA solution thus obtained was applied to the CHO cells and the cells were allowed to stand for 
30 minutes at room temperature. Then 9 ml of ERDF medium containing 10 % FBS was added to the dish 

45 and the cells were cultured for 4 to 5 hours at 37 *C in the presence of 5 % CO2. The medium was then 
removed from the dish. The cells were washed with 5 ml of 1 x TBS + + solution (25 mM Tris-HCI buffer, 
pH 7.5, 140 mM NaCI, 5 mM KCI, 0.6 mM disodium hydrogenphosphate, 0.08 mM CaCI 2 , and 0.08 mM 
MgCb). The 1 x TBS+ + solution was then removed. Then, 5 ml of 1 x TBS + + solution including 20 % 
glycerol was added to the cells and the cells were allowed to stand for 1 to 2 minutes at room temperature 

* ,: i '* ' : .lu Jf "'• " ii'Jif!.; k H war' tii'H.lO-1 " ■ I i : ' ; in.' ' n < .• t '»n . ■ i.-il- vw ■ - i-ruM.-iS ; * r ? >• 

■leseneu '• ■ i ■■ vu. A no 1 4b nouis culture, me medium wuo removeu ami too cens were /vasneu with 
ml of I x TBS + + solution. The cells were then removed from the drsh by a trypsin treatment and the cells 
from one dish were distributed to ten dishes. Agent G418 (G418 sulfate (GENETICIN, GIBCO)) was added 
bb to each dish at a concentration of 200 ug/ml and the cells were cultured. After 10 days, vital cells, which 
resisted G418, were isolated. The cells were then cultured for 7 days in a 24 well plate including 1 ml of 
ERDF medium supplemented by 10 °o FBS Then the ceils were cultured in serum-free ERDF medium 'or 
■'2 hours and the resulting culture in each well was recovered The culture was concentrated by 
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ultrafiltration and electrophoresed by SDS acrylamide gel electrophoresis. The expression of the protein of 
the present invention was detected by western blotting. 

Example 8 Purification and amino acid sequence analysis of the precursor protein with a molecular weight 
5 of about 96,000 dalton 

Blood from a normal individual (human) were mixed with 10 mM EDTA), 10 mM benzamidine (Aldrich 
Chemical), 5 mM 6-amino-n-caproic acid n-hexyl ester p-toluensulfate (Tokyo Chemical), 2 mg/ml soybean 
trypsin inhibitor (Sigma), 50 mM naphamostat mesilate (Torii Pharmaceutical), and 500 KlU/ml aprotinin 

70 (Pentapharm). Cells were immediately removed by centrifugation to obtain plasma. After the plasma is 
diluted with water to three volumes, the diluted plasma was applied on a Heparin-sepharose column 
(pharmacia), which had been pre-equilibrated with buffer A (including 10 mM NaH 2 P04-Na 2 HP04 buffer, 
pH7, 50 mM NaCI), and the column was washed with buffer A. The column was then eluted with a linear 
gradient of 50 mM to 700 mM NaCI in Buffer A and buffer B (including 10 mM NaH 2 P04-Na 2 HPO* buffer, 

75 pH7, 700 mM NaCI). 

As described in Example 10 infra, fractions including the protein of the present invention (fractions 
eluted with about 100 mM - 250 mM NaCI) were recovered by a sandwich enzyme-linked immunosorbent 
assay (ELISA) with the use of an antibody that was prepared by immunizing a mouse with the protein 
derived from human disclosed in Japanese Patent Publication (Kokai) No: 5-103670, fusing single spleen 

20 cell from the mouse and a myeloma cell by a standard method to obtain a hybridoma, and obtaining the 
antibody from the hybridoma. The fractions were mixed with equivalent 2 M ammonium sulfate. The mixture 
was applied to a phenyl-sepharose column (Pharmacia) pre-equilibrated with 1 M ammonium sulfate. The 
column was washed with 1 M ammonium sulfate. The column was then eluted with a linear gradient of 1 M 
to 0 M ammonium sulfate. Fractions including the protein of the present invention (fractions eluted with 

25 about 400 mM - 100 mM ammonium sulfate) were recovered by ELISA. The fractions were concentrated 
with the use of an amicon ultrafiltration membrane YM-30 and buffer C (including 10 mM Tris/HCI buffer, pH 
8 t 150 mM NaCI) was added. The solution was applied to the antibody affinity column described in Example 
3 equilibrated with buffer C. The column was washed with buffer C and eluted with buffer D (50 mM 
Gfycine/HCI buffer, pH3). Fractions were collected and neutralized with buffer E (1M Tris/HCI, pH8). The 

30 resulting protein product was activated with thrombin, analyzed to determine amino acid sequence, and 
electrophoresed by SDS polyacrylamide gel electrophoresis as described below. 

To activate the purified protein, it was incubated with 1 unit/ml thrombin (Enzyme Research) in buffer C 
for 2 hours at 37 • C. The resulting activated protein was reduced with 2-mercaptoethanol in buffer F (6 M 
guanidine hydrochloride, 0.002 M EDTA, 1 M Tris-HCI buffer, pH 8.5) for 2 hours at 40 *C. Monoiodoacetic 

35 acid at the same concentration as the protein was added to the solution, and the mixture was allowed to 
react in the presence of nitrogen gas for 1 hour at room temperature in the darkness to carboxymethylate 
the protein. After the reaction, the resulting solution was applied to YMC pack C4 column (YMC). The 
column was then eluted with a linear gradient of 10 % to 70 % acetonytryl/isopropylalchol (3/7) for 20 
minutes and two major peaks were obtained. These peaks were dried under a reduced pressure, dissolved 

40 in 60 ul of 50 % thfluoroacetic acid (TFA). The solution was added to a glass filter treated with polybrene, 
and the amino acid sequence of the N terminal of the protein was determined by 470A Sequencer (Applied 
Biosystems) employing Edman degradation. The identification of each phenylthiohydantoin (PTH) amino 
acid was performed with the use of MCI gel ODS IHU (0.46 x 15 cm) column (Mitsubishi Kasei Crop.). The 
column was eluted with acetate buffer (lOmM acetate buffer, pH4.7, 0.01% SDS, and 38% acetonitrile) at a 

45 flow rate of 1.2ml/min, at 43 *C, and a PTH-amino acid was detected through 269 nm absorbance. 

The sequence analysis of the reduced carboxy methylated proteins showed that the N terminal amino 
acid sequence of one of these polypeptides has the sequence of SEQ ID NO: 1 1 . The N terminal of another 
peptide was blocked. 

so Example 9 Purification and amino acid sequence analysis of a protein with a molecular weight of about 
34,000 dalton 

Blood from a normal individual (human) was mixed with 10 mM EDTA), 10 mM benzamidine (Aldrich 
Chemical) and 5 mM 6-amino-n-caproic acid n-hexyl ester p-toluensulfonate (Tokyo Chemical). Cells were 
55 immediately removed by centrifugation to obtain plasma. The plasma was applied on a Heparin-sepharose 
column (pharmacia), which had been pre-equilibrated with buffer G (including 10 mM NaH 2 P04-Na 2 HP04 
buffer, pH7, 150 mM NaCI), and the column was washed with buffer G. The column was then eluted with a 
linear gradient of 150 mM to 700 mM NaCI in buffer G and buffer B (including 10 mM NaH 2 P04-Na2HP04 
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buffer, pH7, 700 mM NaCI). 

As described in Example 10 infra, fractions including the protein of the present invention (fractions 
eluted with about 300 mM - 450 mM NaCI) were recovered by a sandwich enzyme-linked immunosorbent 
assay (ELISA) by the use of an antibody that was prepared by immunizing a mouse with a human protein 

5 disclosed in Japanese Patent Publication (Kokai) No: 5-103670, fusing a single spleen cell from the mouse 
and a myeloma cell by a standard method to obtain a hybridoma, and obtaining the antibody from the 
hybridoma. The fractions were applied to a phenyl-sepharose column (Pharmacia) and the column was then 
eluted with a linear gradient of 1 M to 0 M ammonium sulfate as described in Example 1. Fractions 
including the protein of the present invention (fractions eluted with about 700 mM - 500 mM ammonium 

io sulfate) was recovered by ELISA. The fractions were concentrated with the use of an amicon ultrafiltration 
membrane YM-30 and buffer C (including 10 mM Tris/HCI buffer, pH 8, 150 mM NaCI) was added. The 
solution was applied to an aprotinin immobilized affinity column (Pentapharm) equilibrated with buffer C. 
The unbound fraction was applied to an antibody affinity column described in Example 3 equilibrated with 
buffer C. The column was washed with buffer C. 

75 The column was eluted with buffer D (50 mM Glycine/HCI buffer, pH3). Fractions were collected and 

neutralized with buffer E (1M Tris/HCI, pH8). The resulting protein product was activated with thrombin, 
analyzed to determine amino acid sequence, and electrophoresed by SDS polyacrylamide gel elec- 
trophoresis as described below. 

To activate the purified protein, it was incubated with 1 unit/ml thrombin (Enzyme Research) in buffer C 

20 for 2 hours at 37 * C. The resulting activated protein was reduced with 2-mercaptoethanol in buffer F (6 M 
guanidine hydrochloride, 0.002 M EDTA, 1 M Tris-HCI buffer, pH 8.5) for 2 hours at 40 °C. Monoiodoacetic 
acid at the same concentration as the protein was added to the solution, and the mixture was allowed to 
react in the presence of nitrogen gas for 1 hour at room temperature in the darkness to carboxymethylate 
the protein. After the reaction, the resulting solution was applied to YMC pack C4 column (YMC). The 

25 column was then eluted with a linear gradient of 10 % to 70 % acetonytryl/isopropylalchol (3/7) for 30 
minutes and the peaks were obtained. These peaks were dried under a reduced pressure, dissolved in 60 
ul of 50 % trifluoroacetic acid (TFA). The solution was added to a glass filter treated with polybrene, and 
the amino acid sequences of the N terminal of the proteins were determined by 470A Sequencer (Applied 
Biosystems) employing Edman degradation. The identification of each phenylthiohydantoin (PTH) amino 

30 acid was performed with the use of MCI gel ODS IHU (0.46 x 15 cm) column (Mitsubishi Kasei Corp.). The 
column was eluted with acetate buffer (10mM acetate buffer, pH4.7, 0.01% SDS, and 38% acetonitrile) at a 
flow rate of 1.2ml/min, at 43 'C, and a PTH-amino acid was detected through 269 nm absorbance. 

The sequence analysis of the reduced carboxymethylated proteins showed that the N terminal amino 
acid sequence of one of these polypeptides has the sequence of SEQ ID NO: 11. 

35 

Example 10 Preparation of an antibody, ELISA using the same, and preparation of an antibody affinity 
column 

A Balb/c mouse was immunized with a novel human protein disclosed in Japanese Patent Publication 
40 (Kokai) No. 5-103670 by a standard method. The mouse was first immunized with about 10 jig antigen with 
Freund's complete adjuvant and then boosted with about 10 ug antigen with Freund's incomplete adjuvant 
intraperitoneal^. The mouse was given a final boost from tail vein. Three days after the final boost, spleen 
cells of the mouse were taken. Single cell of the mouse spleen was fused with mouse myeloma cell P3U1 
in the presence of polyethylene glycol by a standard method. Fused cells were screened by the use of 
45 enzyme-labeled anti mouse antibodies by ELISA method employing immobilized antigens to select 
hybridomas producing antibodies against the protein. Each of the obtained hybridomas was cultured in 
serum-free medium. The supernatant of the culture was applied to a Protein A affinity column to obtain 
antibodies. Two antibodies recognizing discrete antigenic determinants were used to perform ELISA by a 
standard method The second antibody was labeled with peroxidase by a standard method. One of the 

ii tm r ■ 

Lx ample i 1 SDS polyacrylamide gel electrophoresis 

bb To determine apparent molecular weights of the proteins purified as described in Examples 8 and 9, 

SDS polyacrylamide gel electrophoresis was conducted. The perified proteins were electrophoresed by 

SDS polyacrylamide gel electrophoresis invoVing 12 5 % polyacrylamide slab gel under a non reduced 
■. ondition Moleculai makers were available ficm Phaimacta After electrophoresis, the gel was stained with 

1 1 



EP 0 596 524 A2 , 



Coomassie Brilliant Blue (CBB) R250. The protein prepared in Example 8 was 96,000 dalton and the protein 
prepared in Example 9 was 34,000 dalton determined by SDS polyacrylamide gel electrophoresis. The 
sizes of these proteins were estimated by comparing the location of the bands of the proteins to the 
location of the bands of the marker. 

5 

Example 12 Activation and assay 

To activate the proteins purified in Examples 8 and 9, these proteins were incubated with 1 unit/ml 
thrombin (Enzyme Research) in buffer C for 2 hours at 37 • C. Each of the activated proteins was added to 

w 40 ul of the solution described in the reference example in Japanese Patent Application No. 3-271362. This 
solution included 10 mM benzamidine, 100 mM NaCI, 50 mM Tris-HCI buffer, pH 8, and 5 ug of single- 
chain human HGF. After 2 hours incubation at 37 • C, SDS polyacrylamide gel electrophoresis was done on 
the mixture under a reduced condition. After electrophoresis, the gel was stained with CBB. The activity of 
the protein was determined by the ratio of single-chain HGF to two-chain HGF. The result was shown in Fig. 

75 2, wherein 1 represents the result in the presence of about 100 ng/ml protein obtained by the activation of 
the protein having a molecular weight of about 96,000 dalton, 2 represents the result in the presence of 
about 100 ng/ml protein obtained by the activation of the protein having a molecular weight of about 34,000 
dalton, and 3 represents the result in the absence of the proteins. 

20 Example 13 Purification of the precursor proteins and analysis of the N-terminal amino acid sequences of 
the proteins 

The protein, which has a molecular weight of about 96.000 dalton determined by SDS polyacrylamide 
gel electrophoresis and acquired a protease activity to convert single-chain HGF into two-chain HGF by the 

25 treatment with a serine protease, was obtained from human plasma according to the method described in 
Example 8. This purified protein was reduced with 2-mercaptoethanol in buffer A (6 M guanidine 
hydrochloride, 0.002 M EDTA, and 1 M Tris-HCI buffer, pH 8.5) for 2 hours at 40 "C. Monoiodoacetic acid at 
the same concentration as the protein was added to the solution, and the mixture was allowed to react in 
the presence of nitrogen gas for 1 hour at a room temperature in the darkness to carboxymethylate the 

30 protein. After the reaction, the resulting solution was applied to YMC pack C4 column (YMC). The column 
was then eluted with a linear gradient of 10 % to 70 % acetonytryl/isopropylalchol (3/7) for 30 minutes and 
a single major peak was detected. 

The fraction corresponding to the peak was dissolved in 0.1 % ammonium bicarbonate including 2 M 
urea. TPCK-trypsin (Miles Laboratory) or TLCK-chymotrypsin (Miles Laboratory) was added to the solution 

35 in a ratio of enzyme : substrate = 1 : 50 and the mixture was allowed to react for 16 hours at 37 *C. The 
reaction mixture was applied to HPLC and the column was eluted with a linear gradient of 0 % to 80 % 
acetonitrile/isopropyl alcohol (3/7) for 1 hour to obtain multiple peptide fragments. 

These peptide fragments were dried in the air and dissolved in 60 ul of 50 % trifluoroacetic acid (TFA). 
The solution was added to a glass filter treated with polybrene, and the amino acid sequence of the protein 

40 was determined by S470A Sequencer (Applied Biosystems) involving Edman degradation. The identification 
of phenylthiohydantoin (PTH) amino acid was performed with the use of MCI gel CDS IHU (0.46 x 15 cm) 
column (Mitsubishi Kasei Corp.). The column was eluted with acetate buffer (10mM acetate buffer, pH4.7, 
0.01% SDS, and 38% acetonitrile) at a flow rate of 1.2ml/min, at 43 *C, and a PTH-amino acid was 
detected through 269 nm absorbance. The amino acid sequences of two of these peptides are shown in 

45 SEQ ID NO:6 and 7. 

Example 14 Preparation of DNA fragment including parts of the gene encoding the precursor protein by 
PCR 

50 Commercially available human liver cDNA bank (Quickclone™ human liver cDNA, Clonetech) was used 

as a template DNA. PCR was performed by Perkin Elmer Cetus DNA Thermal Cycler with the use of Gene 
Amp DNA Amplification Reagent Kit (Takara Shuzo Co. Ltd.). First, 100 ul of reaction mixture was prepared 
including a template DNA (1ng), 10 ul of xlO reaction buffer (500 mM KCI, 100 mM Tris-HCI buffer, pH8.3, 
15 mM MgCI 2 , and 0.1% (w/v) gelatin), 2 ul each of 10 mM dGTP, dATP, dCTP, and dTTP, 2 ul of ( + ) 

55 strand DNA primer of SEQ ID NO: 4 as primer #1, 0.1 uM (-) strand DNA primer of SEQ ID NO: 5 as primer 
#2 to make final concentration to 0.1 uM, 0.5 ul of Taq DNA polymerase, and sterilized deionized water. 
Template DNA was then amplified by 30 cycles of the PCR including pretreatment at 94 *C for 10 minutes, 
denaturation at 94 *C for 1 minute, annealing at 37 *C for 2 minutes, and extention at 72 *C for 3 minutes. 
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The reaction was stopped by incubation at 72 'C for 7 minutes. The resulting reaction mixture was 
extracted with phenol:chroloform = 1 :1 , and the resulting DNAs were ethanol precipitated. The precipitate 
was dissolved in 21.5 ul of sterilized deionized water. Then, 2.5 ul of x10 restriction enzyme buffer (50 mM 
Tris-HCI buffer, pH7.5, 10 mM MgCI 2 , 100 mM KCI, and 1 mM DTT) and 1 ul of restriction enzyme Bgl ll 

5 (15 units) were added to the solution and the resulting mixture was allowed to react at 37 *C for 2 hours. 
The reaction mixture was electrophoresed using 5 % polyacrylamide gel, the 323 bp band was extracted 
from the gel by a standard technique to recover a DNA fragment, and ethanol precipitated. 

The DNA fragment was inserted into pUCl9 vector at BamHI site and the base sequence thereof was 
identified according to a standard method. The base sequence of the identified DNA fragment prepared by 

to PCR is shown in SEQ ID NO: 13. 

Example 15 Screening for a DNA fragment including complete gene encoding the precursor protein of the 
present invention 

75 The 323 bp fragment prepared as described in Example 14 was labeled with 32 P according to the 

method described in "Molecular Cloning" (Cold Spring Harbor Laboratory, 1982) and used as a probe for 
screening. Thfi library to be screened was Premade Rambda Phage Library (Stratagene), which was a 
human liver cDNA library derived from 49 years old male. E.coli XL1-Blue (Stratagene) was infected with 
the phage to make about 500 million plaques. The infected cells were cultured on NZY medium and then 

20 transferred on Gene Screen Plus membranes (Dupont). The membranes were allowed to stand on paper 
filters soaked with 0.1 M NaOH - 0.5 M NaCI for 2 minutes, and then on paper filters soaked with 1.5 M 
NaCI-0-5 M Tris-HCI buffer (pH7.5) for 5 minutes. The membranes were further treated two more times, 
washed with 2 x SSC, and dried on a dried paper filter in the air. DNAs on the membranes were fixed with 
UV light at 120 mJ/cm 2 . 

25 Five membranes thus treated were soaked in 50 ml of solution including 50 mM Tris-HCI buffer, pH 7.5, 

1 M NaCI, and 1 % SDS and allowed to stand at 65 ° C for 2 hours. The membranes were hybridized in 40 
ml of solution including 5 ng/ml P labeled probe, 100 ug/ml salmon sperm DNA, 50 mM Tris-HCI buffer, pH 
7.5, 1 M NaCI, and 1% SDS at 65 °C for 16 hours. The membranes were then washed in 2 x SSC at a room 
temperature for 5 minutes, 0.1 x SSC at a room temperature for 30 minutes for two times, and 

30 autoradiographed by a standard method, thereby positive clones were obtained. 

Examples 16 Subcloning of DNA fragments and identification of the base sequences thereof 

Plasmids were prepared directly from positive phage clones obtained in Example 15 by the excision 
35 method. Phage was extracted from single plaque by the treatment with 500 ul of SM buffer (50 mM Tris- 
HCI buffer, pH 7.5, 100 mM NaCI, 10 mM Mg 2 SCU, and 0.01 % gelatin) and 20 ul of chroloform. A mixture 
including 200 ul of the above-mentioned phage extract, 200 ul of XL1-Blue, and 1 ul of R408 helper phage 
was allowed to stand 37° C for 15 minutes. Then 5 ml of 2 x YT medium was added to the mixture and the 
resulting solution was shake cultured at 37 • C for 3 hours. The culture was then heated to 70 ■ C for 20 
40 minutes followed by centrifugation with 4000g for 5 minutes to obtain a supernatant. The supernatant was 
diluted to make 100-foid diluted solution. Then 20 ul of the diluted solution was mixed with 200 ul of XL1- 
Blue and the mixture was allowed to react at 37 a C for 15 minutes. Then 2 ul of the resulting mixture was 
plated onto a LB agarose medium including 40 ul/ml ampicillin. Twenty-four plasmids from the colonies 
which appeared on the plate were obtained and analyzed. The base sequence thus determined is shown in 
45 SEQ ID NO: 3. Based on this base sequence, the amino acid sequence of the precursor protein of the 
present invention was deduced (SEQ ID NO: 12). 

The present invention enables a stable and simple production of a protein having a protease activity to 
convert single-chain HGF to active two-chain HGF and the precursor proteins thereol in a large amount. 
Those proteins can be used as a regulator in producing active HGF 
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SEQ ID NO: 1 

SEQUENCE TYPE: amino acid 
SEQUENCE LENGTH: 300 
STRANDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
ORIGINAL SOURCE 
ORGANISM: human 



Ala Leu Ser Trp Glu Tyr Cys Arg Leu Glu Ala Cys Glu Ser Leu Thr 

20 

15 10 15 

Arg Val Gin Leu Ser Pro Asp Leu Leu Ala Thr Leu Pro Glu Pro Ala 
25 20 25 30 

Ser Pro Gly Arg Gin Ala Cys Gly Arg Arg His Lys Lys Arg Thr Phe 
35 40 45 

30 Leu Arg Pro Arg lie lie Gly Gly Ser Ser Ser Leu Pro Gly Ser His 
50 55 60 

Pro Trp Leu Ala Ala lie Tyr lie Gly Asp Ser Phe Cys Ala Gly Ser 

35 

65 70 75 80 

Leu Val His Thr Cys Trp Val Val Ser Ala Ala His Cys Phe Ser His 
40 85 90 95 

Ser Pro Pro Arg Asp Ser Val Ser Val Val Leu Gly Gin His Phe Phe 
100 105 110 

45 Asn Arg Thr Thr Asp Val Thr Gin Thr Phe Gly lie Glu Lys Tyr lie 
115 120 125 

Pro Tyr Thr Leu Tyr Ser Val Phe Asn Pro Ser Asp His Asp Leu Val 

50 

130 135 140 



55 
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Leu He Arg Leu Lys Lys Lys Gly Asp Arg Cys Ala Thr Arg Ser Gin 
145 150 155 160 

Phe Val Gin Pro He Cys Leu Pro Glu Pro Gly Ser Thr Phe Pro Ala 

165 170 175 

Gly His Lys Cys Gin He Ala Gly Trp Gly His Leu Asp Glu Asn Val 

180 185 190 

Ser Gly Tyr Ser Ser Ser Leu Arg Glu Ala Leu Val Pro Leu Val Ala 

195 200 205 

Asp His Lys Cys Ser Ser Pro Glu Val Tyr Gly Ala Asp He Ser Pro 

2i0 215 220 

Asn Met Leu Cys Ala Gly Tyr Phe Asp Cys Lys Ser Asp Ala Cys Gin 
225 230 235 240 

Gly Asp Ser Gly Gly Pro Leu Ala Cys Glu Lys Asn Gly Val Ala Tyr 

245 250 255 

Leu Tyr Gly He He Ser Trp Gly Asp Gly Cys Gly Arg Leu His Lys 

260 265 270 

Pro Gly Val Tyr Thr Arg Val Ala Asn Tyr Val Asp Trp lie Asn Asp 

275 280 285 

Arg He Arg Pro Pro Arg Arg Leu Val Ala Pro Ser 
290 295 300 



SEQ ID NO: 2 

SEQUENCE TYPE: nucleic acid 
SEQUENCE LENGTH: 329 
STRANDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
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ORGANISM: human 

IMMEDIATE EXPERIMENTAL SOURCE: Quick-cloneTM human liver cDNA 
5 (Clonetech) 

GGATCC CAG ATT GCG GGC TGG GGC CAC TTG GAT GAG AAC GTG AGC GGC 48 
w Gin lie Ala Gly Trp Gly His Leu Asp Glu Asn Val Ser Gly 

15 10 
TAC TCC AGC TCC CTG CGG GAG GCC CTG GTC CCC CTG GTC GCC GAC CAC 96 
Tyr Ser Ser Ser Leu Arg Glu Ala Leu Val Pro Leu Val Ala Asp His 
15 20 25 30 

AAG TGC AGC AGC CCT GAG GTC TAC GGC GCC GAC ATC AGC CCC AAC ATG 144 
Lys Cys Ser Ser Pro Glu Val Tyr Gly Ala Asp lie Ser Pro Asn Met 

35 40 45 

CTC TGT GCC GGC TAC TTC GAC TGC AAG TCC GAC GCC TGC CAG GGG GAC 192 
Leu Cys Ala Gly Tyr Phe Asp Cys Lys Ser Asp Ala Cys Gin Gly Asp 

50 55 60 

TCA GGG GGG CCC CTG GCC TGC GAG AAG AAC GGC GTG GCT TAC CTC TAC 24 0 
Ser Gly Gly Pro Leu Ala Cys Glu Lys Asn Gly Val Ala Tyr Leu Tyr 

65 70 75 

GGC ATC ATC AGC TGG GGT GAC GGC TGC GGG CGG CTC CAC AAG CCG GGG 288 
Gly lie lie Ser Trp Gly Asp Gly Cys Gly Arg Leu His Lys Pro Gly 

80 85 90 

GTC TAC ACC CGC GTG GCC AAC TAT GTG GAC TGG AT GGATCC 329 
Val Tyr Thr Arg Val Ala Asn Tyr Val Asp Trp 
95 100 105 



25 



30 



35 



40 



45 



50 



SEQ ID NO: 3 

SEQUENCE TYPE: nucleic acid 
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SEQUENCE LENGTH: 970 
STRANDNESS: double 
5 TOPOLOGY: linear 

MOLECULE TYPE: CDNA 
ORIGINAL SOURCE 
ORGANISM: human 

IMMEDIATE EXPERIMENTAL SOURCE: Pre-made Lambda phage Library, 
human liver (49, male) cDNA Library (Stratagene) 







CTGGGAGTAC 


TGCCGCCTGG 


AGGCCTGCGA 


ATCCCtCaCC 


AGAGTCCAAC 


60 


20 


TGTCACCGGA 


TCTCCTGGCG 


ACCCTGCCTG 


AGCCAGCCTC 


CCCGGGGCGC 


CAGGCCTGCG 


120 


GCAGGAGGCA 


CAAGAAGAGG 


ACGTTCCTGC 


GGCCACGTAT 


CATCGGCGGC 


TCCTCCTCGC 


180 




TGCCCGGCTC 


GCACCCCTGG 


CTGGCCGCCA 


TCTACATCGG 


GGACAGCTTC 


TGCGCCGGGA 


240 


25 


GCCTGGTCCA 


CACCTGCTGG 


GTGGTGTCGG 


CCGCCCACTG 


CTTCTCCCAC 


AGCCCCCCCA 


300 




GGGACAGCGT 


CTCCGTGGTG 


CTGGGCCAGC 


ACTTCTTCAA 


CCGCACGACG 


GACGTGACGC 


360 




AG AC C TTCGG 


CATCGAGAAG 


TACATCCCGT 


ACACCCTGTA 


CTCGGTGTTC 


AACCCCAGCG 


420 


30 


ACCACGACCT 


CGTCCTGATC 


CGGCTGAAGA 


AGAAAGGGGA 


CCGCTGTGCC 


ACACGCTCGC 


480 




AGTTCGTGCA 


GCCCATCTGC 


CTGCCCGAGC 


CCGGCAGCAC 


CTTCCCCGCA 


GGACACAAGT 


540 




GCCAGATTGC 


GGGCTGGGGC 


CAC TTGGATG 


AG AAC GTGAG 


CGGCTACTCC 


AGCTCCCTGC 


600 


35 


GGGAGGCCCT 


GGTCCCCCTG 


GTCGCCGACC 


ACAAGTGCAG 


CAGCCCTGAG 


GTCTAC GGCG 


660 




CCGACATCAG 


CCCCAACATG 


CTCTGTGCCG 


GCTACTTCGA 


CTGCAAGTCC 


GACGCCTGCC 


720 




AGGGGGACTC 


AGGGGGGCCC 


CTGGCCTGCG 


AG AAGAAC GG 


CGTGGCTTAC 


CTCTACGGCA 


780 


40 


TCATCAGCTG 


GGGTGACGGC 


TGCGGGCGGC 


TCCACAAGCC 


GGGGG TCTAC 


ACCCGCGTGG 


840 




CCAACTATGT 


GGACTGGATC 


AACGACCGGA 


TACGGCCTCC 


CAGGCGGCTT 


GTGGCTCCCT 


900 




CCTGACCCTC 


CAGCGGGACA 


CCCTGGTTCC 


CACCATTCCC 


TGCCTTGCTG 


ACAATAAAGA 


960 


45 


TATTTCCAAG 












970 



SEQ ID NO: 4 
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SEQUENCE TYPE: nucleic acid 
SEQUENCE LENGTH: 26 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
ORIGINAL SOURCE 
ORGANISM: human 

CTCGGATCCC ARATNGCNGG NTGGGG 

R : G or A, N : A, G, C or T 

SEQ ID NO: 5 

SEQUENCE TYPE: nucleic acid 
SEQUENCE LENGTH: 26 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
ORIGINAL SOURCE 
ORGANISM: human 

CTCGGATCCA TCCARTCNAC RTARTT 

R : G or A, N : A, G, C or T 

SEQ ID NO: 6 

SEQUENCE TYPE: amino acid 

SEQUENCE LENGTH: 7 

TOPOLOGY: linear 

MOLECULE TYPE: peptide 

FRAGMENT TYPE: internal fragment 

ORIGINAL SOURCE 



18 
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ORGANISM: human 

Cys Gin lie Ala Gly Trp Gly 
1 5 

SEQ ID NO: 7 

SEQUENCE TYPE: amino acid 

SEQUENCE LENGTH r 8 

TOPOLOGY: linear 

MOLECULE TYPE: peptide 

FRAGMENT TYPE: internal fragment 

ORIGINAL SOURCE 

ORGANISM: human 

Val Ala Asn Tyr Val Asp Trp lie 
1 5 

SEQ ID NO: 8 

SEQUENCE TYPE: nucleic acid 
SEQUENCE LENGTH: 19 
TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic 
ORIGINAL SOURCE 
ORGANISM: human 

GTC CAACTGT CACCGGATC 
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SEQUENCE TYPE: nucleic acid 

SEQUENCE LENGTH: 19 
5 TOPOLOGY: linear 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 

ORIGINAL SOURCE 
*° ORGANISM: human 

CGCTCGAGGG TCAGGAGGG 19 

SEQ ID NO: 10 

SEQUENCE TYPE: nucleic acid 
20 SEQUENCE LENGTH: 71 

STRANDNESS : double 
TOPOLOGY: linear: 

MOLECULE TYPE: Other nucleic acid (synthetic DNA) 
ORIGINAL SOURCE 
ORGANISM: human 

30 

AATT CGGATC CATGAAGGTT CTGTGGGCTG CGTTGCTGGT CACATTCCTG GCAGGATGCC 60 
GCCTAG GTACTTCCAA GACACCCGAC GCAACGAC C A GTGTAAGGAC CGTCCT AC GG 

35 

AGGC CAAGGT G 71 
TC CGGTTCC A C 

40 

SEQ ID NO: 11 

SEQUENCE TYPE: amino acid 

SEQUENCE LENGTH: 8 

45 

TOPOLOGY : linear 
MOLECULE TYPE: protein 

50 
55 
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FRAGMENT TYPE: partial peptide 
ORIGINAL SOURCE 
ORGANISM: human 
TISSUE TYPE: plasuma 

lie lie Gly Gly Ser Ser Ser Leu 
1 5 8 

SEQ ID NO: 12 
SEQUENCE TYPE: amino acid 
SEQUENCE LENGTH: 655 
STRANDEDNESS : single 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
ORIGINAL SOURCE 
ORGANISM: human 

Met Gly Arg Trp Ala Trp Val Pro Ser Pro Trp Pro Pro Pro Gly Leu 

15 10 15 

Gly Pro Phe Leu Leu Leu Leu Leu Leu Leu Leu Leu Leu Pro Arg Gly 

20 25 30 

Phe Gin Pro Gin Pro Gly Gly Asn Arg Thr Glu Ser Pro Glu Pro Asn 

35 40 45 

Ala Thr Ala Thr Pro Ala lie Pro Thr lie Leu Val Thr Ser Val Thr 

50 55 60 

Ser Glu Thr Pro Ala Thr Ser Ala Pro Glu Ala Glu Gly Pro Gin Ser 
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Gly Gly Leu Pro Pro Pro Pro Arg Ala Val Pro Ser Ser Ser Ser Pro 

85 90 95 

5 

Gin Ala Gin Ala Leu Thr Glu Asp Gly Arg Pro Cys Arg Phe Pro Phe 

100 105 110 

Arg Tyr Gly Gly Arg Met Leu His Ala Cys Thr Ser Glu Gly Ser Ala 

1 o 

115 120 125 

His Arg Lys Trp Cys Ala Thr Thr His Asn Tyr Asp Arg Asp Arg Ala 
15 130 135 140 

Trp Gly Tyr Cys Val Glu Ala Thr Pro Pro Pro Gly Gly Pro Ala Ala 
145 150 155 160 

20 Leu Asp Pro Cys Ala Ser Gly Pro Cys Leu Asn Gly Gly Ser Cys Ser 

165 170 175 

Asn Thr Gin Asp Pro Gin Ser Tyr His Cys Ser Cys Pro Arg Ala Phe 

25 

180 185 190 

Thr Gly Lys Asp Cys Gly Thr Glu Lys Cys Phe Asp Glu Thr Arg Tyr 
195 200 205 

30 

Glu Tyr Leu Glu Gly Gly Asp Arg Trp Ala Arg Val Arg Gin Gly His 
210 215 220 

35 Val Glu Gin Cys Glu Cys Phe Gly Gly Arg Thr Trp Cys Glu Gly Thr 
225 230 235 240 

Arg His Thr Ala Cys Leu Ser Ser Pro Cys Leu Asn Gly Gly Thr Cys 
40 245 250 255 

His Leu lie Val Ala Thr Gly Thr Thr Val Cys Ala Cys Pro Pro Gly 
260 265 270 

45 

Phe Ala Gly Arg Leu Cys Asn lie Glu Pro Asp Glu Arg Cys Phe Leu 
275 280 285 

5o Gly Asn Gly Thr Gly Tyr Arg Gly Val Ala Ser Thr Ser Ala Ser Gly 
290 295 300 



55 
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Leu Ser Cys Leu Ala 
305 

Val Asp Ser Val Gly 
325 

Tyr Cys Arg Asn Pro 
340 

Lys Asp Ser Ala Leu 
355 

Ser Leu Thr Arg Val 
370 

Glu Pro Ala Ser Pro 

385 

Arg Thr Phe Leu Arg 
405 

Gly Ser His Pro Trp 
420 

Ala GLy Ser Leu Val 
435 

Phe Ser His Ser Pro 
450 

His Phe Phe Asn Arg 
465 

Lys Tyr lie Pro Tyr 
485 

Asp Leu Val Leu lie 
500 



Trp Asn Ser Asp Leu 
310 

Ala Ala Ala Leu Leu 
330 

Asp Asn Asp Glu Arg 
345 

Ser Trp Glu Tyr Cys 
360 

Gin Leu Ser Pro Asp 
375 

Gly Arg Gin Ala Cys 
390 

Pro Arg lie lie Gly 
410 

Leu Ala Ala lie Tyr 
425 

His Thr Cys Trp Val 
440 

Pro Arg Asp Ser Val 
455 

Thr Thr Asp Val Thr 
470 

Thr Leu Tyr Ser Val 
490 

Arg Leu Lys Lys Lys 
505 



Leu Tyr Gin Glu Leu His 
315 320 
Gly Leu Gly Pro His Ala 
335 

Pro Trp Cys Tyr Val Val 
350 

Arg Leu Glu Ala Cys Glu 
365 

Leu Leu Ala Thr Leu Pro 
380 

Gly Arg Arg His Lys Lys 
395 400 
Gly Ser Ser Ser Leu Pro 
415 

lie Gly Asp Ser Phe Cys 
430 

Val Ser Ala Ala His Cys 
445 

Ser Val Val Leu Gly Gin 
460 

Gin Thr Phe Gly lie Glu 
475 480 
Phe Asn Pro Ser Asp His 
495 

Gly Asp Arg Cys Ala Thr 
510 
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w 



20 



25 



30 



40 



45 



Phe Pro Ala Gly His Lys Cys Gin He Ala Gly Trp Gly His Leu Asp 

530 535 540 

Glu Asn Val Ser Gly Tyr Ser Ser Ser Leu Arg Glu Ala Leu Val Pro 
545 550 555 560 

Leu Val Ala Asp His Lys Cys Ser Ser Pro Glu Val Tyr Gly Ala Asp 

565 570 575 

He Ser Pro Asn Met Leu Cys Ala Gly Tyr Phe Asp Cys Lys Ser Asp 

580 585 590 

Ala Cys Gin Gly Asp Ser Gly Gly Pro Leu Ala Cys Glu Lys Asn Gly 

595 600 605 

Val Ala Tyr Leu Tyr Gly lie lie Ser Trp Gly Asp Gly Cys Gly Arg 

610 615 620 

Leu His Lys Pro Gly Val Tyr Thr Arg Val Ala Asn Tyr Val Asp Trp 
625 630 635 640 

He Asn Asp Arg He Arg Pro Pro Arg Arg Leu Val Ala Pro Ser 

645 650 655 



SEQ ID NO: 13 
35 SEQUENCE TYPE: nucleic acid 
SEQUENCE LENGTH: 329 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
ORIGINAL SOURCE 
ORGANISM: human 

IMMEDIATE EXPERIMENTAL SOURCE: Quick-cloneTM human liver cDNA 
50 (Clonetech) 
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15 



GGATCC CAG ATT GCG GGC TGG GGC CAC TTG GAT GAG AAC GTG AGC GGC 48 
Gin lie Ala Gly Trp Gly His Leu Asp Glu Asn Val Ser Gly 
5 1 5 10 

TAC TCC AGC TCC CTG CGG GAG GCC CTG GTC CCC CTG GTC GCC GAC CAC 9 6 

Tyr Ser Ser Ser Leu Arg Glu Ala Leu Val Pro Leu Val Ala Asp His 
io 15 20 25 30 

AAG TGC AGC AGC CCT GAG GTC TAC GGC GCC GAC ATC AGC CCC AAC ATG 144 
Lys Cys Ser Ser Pro Glu Val Tyr Gly Ala Asp lie Ser Pro Asn Met 

35 40 45 

CTC TGT GCC GGC TAC TTC GAC TGC AAG TCC GAP- GCC TGC CAG GGG GAC 19? 
Leu Cys Ala Gly Tyr Phe Asp Cys Lys Ser Asp Ala Cys Gin Gly Asp 

50 55 60 

TCA GGG GGG CCC CTG GCC TGC GAG AAG AAC GGC GTG GCT TAC CTC TAC 240 
Ser Gly Gly Pro Leu Ala Cys Glu Lys Asn Gly Val Ala Tyr Leu Tyr 

65 70 75 

GGC ATC ATC AGC TGG GGT GAC GGC TGC GGG CGG CTC CAC AAG CCG GGG 28 8 
Gly lie lie Ser Trp Gly Asp Gly Cys Gly Arg Leu His Lys Pro Gly 

80 85 90 

GTC TAC ACC CGC GTG GCC AAC TAT GTG GAC TGG AT GGATCC 329 
Val Tyr Thr Arg Val Ala Asn Tyr Val Asp Trp 
95 100 105 



20 



25 



30 



35 



40 SEQ ID NO: 14 

SEQUENCE TYPE: nucleic acid 
SEQUENCE LENGTH: 2033 

45 STRANDEDNESS : double 

TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
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ORIGINAL SOURCE 
ORGANISM: human 

IMMEDIATE EXPERIMENTAL SOURCE: Pre-made Lambda phage Library, 
human liver (49, male) cDNA Library (Stratagene) 

w ATGGGGCGCT GGGCCTGGGT CCCCAGCCCC TGGCCCCCAC CGGGGCTGGG CCCCTTCCTC 60 
CTCCTCCTCC TGCTGCTGCT GCTGCTGCCA CGGGGGTTCC AGCCCCAGCC TGGCGGGAAC 120 
CGTACGGAGT CCCCAGAACC TAATGCCACA GCGACCCCTG CGATCCCCAC TATCCTGGTG 180 

15 ACCTCTGTGA CCTCTGAGAC CCCAGCAACA AGTGCTCCAG AGGCAGAGGG ACCCCAAAGT 240 
GGGGGGCTCC CGCCCCCGCC CAGGGCAGTT CCCTCGAGCA GTAGCCCCCA GGCCCAAGCA 3 00 
CTCACCGAGG ACGGGAGGCC CTGCAGGTTC CCCTTCCGCT ACGGGGGCCG CATGCTGCAT 3 60 

20 GCCTGCACTT CGGAGGGCAG TGCACACAGG AAGTGGTGTG CCACAACTCA CAACTACGAC 4 20 
CGGGACAGGG CCTGGGGCTA CTGTGTGGAG GCCACCCCGC CTCCAGGGGG CCCAGCTGCC 4 80 
CTGGATCCCT GTGCCTCCGG CCCCTGCCTC AATGGAGGCT CCTGCTCCAA TACC CAGGAC 54 0 

25 CCCCAGTCCT ATCACTGCAG CTGCCCCCGG GCCTTCACCG GCAAGGACTG CGGCACAGAG 6 00 
AAATGCTTTG ATGAGACCCG CTACGAGTAC CTGGAGGGGG GCGACCGCTG GGCCCGCGTG 660 
CGCCAGGGCC ACGTGGAACA GTGCGAGTGC TTCGGGGGCC GGACCTGGTG CGAAGGCACC 7 20 

30 CGACATACAG CTTGTCTGAG CAGCCCTTGC CTGAACGGGG GCACCTGCCA CCTGATCGTG 7 80 
GCCACCGGGA CCACCGTGTG TGCCTGCCCA CCAGGCTTCG CTGGACGGCT CTGCAACATC 840 
GAGCCTGATG AGCGCTGCTT CTTGGGGAAC GGCACTGGGT AC CGTGGCGT GGCCAGCACC 900 

35 TCAGCCTCGG GCCTCAGCTG CCTGGCCTGG AACTCCGATC TGCTCTACCA GGAGC TGCAC 960 
GTGGACTCCG TGGGCGCCGC GGCC CTGCTG GGCC TGGGCC CCCATGCCTA CTGCCGGAAT 1020 
CCGGACAATG ACGAGAGGCC CTGGTGC TAC GTGGTGAAGG ACAGCGCGCT CTCCTGGGAG 1080 

40 TACTGCCGCC TGGAGGCCTG CGAATCCCTC ACCAGAGTCC AACTGTCACC GGATCTCCTG 114 0 
GCGACCCTGC CTGAGCCAGC CTCCCCGGGG CGCCAGGCCT GCGGCAGGAG GCACAAGAAG 1200 
AGGACGTTCC TGCGGCCACG TATCATCGGC GGCTCCTCCT CGCTGCCCGG CTCGCACCCC 126 0 

45 TGGCTGGCCG C CATC TACAT CGGGGACAGC TTCTGCGCCG GGAGC CTGGT CCACACCTGC 1320 
TGGGTGGTGT CGGCCGCCCA CTGCTTCTCC CACAGCCCCC CCAGGGACAG CGTCTCCGTG 1380 

50 

55 
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5 



10 



75 



GTGCTGGGCC 


AGCACTTCTT 


CAACCGCACG 


ACGGACGTGA 


CGCAGACCTT 


CGGCATCGAG 


1440 


AAGTACATCC 


CGTACACCCT 


GTACTCGGTG 


TTCAACCCCA 


GCGACCACGA 


CCTCGTCCTG 


1500 


ATCCGGCTGA 


AGAAGAAAGG 


GGACCGCTGT 


GCCACACGCT 


CGCAGTTCGT 


GCAGCCCATC 


1560 


TGCCTGCCCG 


AGCCCGGCAG 


CACCTTCCCC 


GCAGGACACA 


AGTGCCAGAT 


TGCGGGCTGG 


1620 


GGCCACTTGG 


ATGAGAACGT 


GAGCGGCTAC 


TCCAGCTCCC 


TGCGGGAGGC 


CCTGGTCCCC 


1680 


CTGGTCGCCG 


AC C ACAAGTG 


CAGCAGCCCT 


GAGGTC TACG 


GCGCCGACAT 


CAGCCCCAAC 


1740 


ATGCTCTGTG 


CCGGCTACTT 


CGACTGCAAG 


TCCGACGCCT 


G C CAGGGGG A 


CTCAGGGGGG 


1800 


CCCCTGGCCT 


GCGAGAAGAA 


CGGCGTGGCT 


TACC TCTACG 


GCATCATCAG 


CTGGGGTGAC 


1860 


GGCTGCGGGC 


GGCTCCACAA 


GCCGGGGGTC 


TACACCCGCG 


TGGCCAACTA 


TGTGGAC TGG 


1920 


ATCAACGACC 


GGATACGGCC 


TCCCAGGCGG 


CTTGTGGCTC 


CCTCCTGACC 


CTCCAGCGGG 


1380 


ACACCCTGGT 


TCCCACCATT 


CCCTGCCTTG 


CTGACAATAA 


AGATATTTCC 


AAG 


2033 
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Claims 

25 1. A protein having the amino acid sequence of SEQ ID NO: 1. 

2. A gene encoding the protein according to claim 1 . 

3. The gene according to claim 2, wherein said gene comprises the base sequence of SEQ ID NO: 2 as a 
30 part of the gene. 

4. The gene according to claim 2, wherein said gene has the base sequence of SEQ ID NO: 3. 

5. An expression vector comprising the gene according to claim 2, wherein said gene is positioned 
35 downstream of a promoter so as to express a polypeptide encoded by the gene. 

6. A transformant obtained by transformation of a host cell with the vector according to claim 5. 

7. A method for producing a protein having the amino acid sequence of SEQ ID NO: 1 comprising a step 
40 of culturing the transformant according to claim 6. 

8. A method for producing a hepatocyte growth factor with the use of the protein according to claim 1 as 
a regulator. 

45 9. Novel protein characterized by the following physical and chemical properties: 

(i) its molecular weight is about 96,000 dalton or 34,000 dalton determined by SDS polyacrylamide 
gel electrophoresis; and 

(H) by the treatment with a serine protease, it acquires a protease activity to specifically cleave the 
human hepatocyte growth factor consisting of 728 amino acid residues at a site between arginine at 

NO: I l as a part ot the protein. 

bb 11. A method for producing a hepatocyte growth factor with the use of the protein according to claim 9 as 
a regulator. 

12. A protein having the amno a-,, id sequorn o . =t 3CQ D NO 1? 

27 
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13. A gene encoding the protein according to claim 12. 

14. The gene according to claim 13, wherein said gene comprises the base sequence of SEQ ID NO: 13 as 
a part of the gene. 

5 

15. The gene according to claim 13, wherein said gene has the base sequence of SEQ ID NO: 14. 

16. An expression vector comprising the gene according to claim 13, wherein said gene is positioned 
downstream of a promoter so as to express a polypeptide encoded by the gene. 

w 

17. A transformant obtained by transformation of a host cell with the vector according to claim 16. 

18. A method of producing a protein having the amino acid sequence of SEQ ID NO: 12 comprising a step 
of culturing the transformant according to claim 17. 

75 

19. A method of producing a hepatocyte growth factor with the use of the protein according to claim 12 as 
a regulator. 

20 
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35 



40 



45 
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Fig 1 
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Fig 2 
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